Differentially Private Policy Evaluation ( Supplementary Material )
نویسندگان
چکیده
Proof. Fix a pair of neighbouring datasets X ' X ′ and let E ⊆ R be any measurable set. Let ΘX,X′ be as in the statement and write ΘX,X′ = R \ΘX,X′ . Using the assumptions on ΘX,X′ we see that P[θX ∈ E] = P[θX ∈ E ∩ΘX,X′ ] + P[θX ∈ E ∩ΘX,X′ ] ≤ eP[θX′ ∈ E ∩ΘX,X′ ] + δ ≤ eP[θX′ ∈ E] + δ . Now we proceed with the proof of Lemma 1. Let X ' X ′ be two neighbouring datasets and let us write Z1 = ZX and Z2 = ZX′ for simplicity. Thus, for i = 1, 2 we have that Zi ∼ N (μi, σ i I) are d-dimensional independent Gaussian random variables whose means and variances satisfy the assumptions of Lemma 1 for some ε, δ > 0. The density function of Zi is denoted by fZi(z). In order to be able to apply Lemma 2 we want to show that the privacy loss between Z1 and Z2 defined as L(z) = ln fZ1(z) fZ2(z) (1)
منابع مشابه
Differentially Private Policy Evaluation
We present the first differentially private algorithms for reinforcement learning, which apply to the task of evaluating a fixed policy. We establish two approaches for achieving differential privacy, provide a theoretical analysis of the privacy and utility of the two algorithms, and show promising results on simple empirical examples.
متن کاملInverse Reinforcement Learning with Simultaneous Estimation of Rewards and Dynamics - Supplementary Material
This document contains supplementary material to the paper Inverse Reinforcement Learning with Simultaneous Estimation of Rewards and Dynamics with more detailed derivations, additional proofs to lemmata and theorems as well as larger illustrations and plots of the evaluation task. 1 Partial Derivative of the Policy
متن کاملMCBS Highlights: Ownership and Average Premiums for Medicare Supplementary Insurance Policies
This article describes private supplementary health insurance holdings and average premiums paid by Medicare enrollees. Data were collected as part of the 1992 Medicare Current Beneficiary Survey (MCBS). Data show the number of persons with insurance and average premiums paid by type of insurance held--individually purchased policies, employer-sponsored policies, or both. Distributions are show...
متن کاملSupplementary Appendix for Informative Cheap Talk in Elections
This Supplementary Appendix formalizes two extensions of the baseline model discussed in Section 5 of the main text: we allow candidates to have some private information about the state of the world when campaigning (Supplementary Appendix A); and we consider more than two policy-preference types and actions (Supplementary Appendix B). A. Pre-election Private Information about the State A.1. Mo...
متن کاملEvaluation of Public Servants’ Acceptability of Public-Private Partnership in Housing Delivery for Low-Income Public Servants in Akure, Nigeria
Nigeria has had several housing programmes and policies geared towards the provision of housing her citizens since colonial era to the post-colonial period. The Nigerian Government had always been directly involved in the provision of housing for the public servants and with the advent of the public-private partnership initiative, the low-income public servants’ acceptability of this new housin...
متن کامل